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Draft Genome Sequence of Lactobacillus crispatus EM-LCI, an Isolate 
with Antimicrobial Activity Cultured from an Elderly Subject 
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Here we report the 1.86-Mb draft genome sequence of Lactobacillus crispatus EM-LCI, a fecal isolate with antimicrobial activity. 
This genome sequence is expected to provide insights into the antimicrobial activity of L. crispatus and improve our knowledge 
of its potential probiotic traits. 
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Lactobacilli are subdominant members of the human gut mi- 
crobiota and encompass a considerable number of different 
species that display a relatively high degree of diversity ( 1 ) . A num- 
ber of Lactobacillus species are considered to have probiotic prop- 
erties, offering beneficial roles in maintaining the health status of 
the host (2). Lactobacillus crispatus can persist in the gastrointes- 
tinal tract (3) and is among the most prevalent species of the 
Lactobacillus-dommated human vaginal microbiota (4). L. crispa- 
tus EM-LCI was isolated from a fecal sample obtained from an 
elderly subject and showed H 2 0 2 -independent antimicrobial ac- 
tivity in cell-free supernatant. The genome sequence of this strain 
will provide a genomic platform for investigation of L. crispatus 
antimicrobial activity and will elucidate the genetic basis for its 
potential probiotic traits. 

The genome was sequenced at Macrogen (Seoul, South Korea) 
on the Illumina platform, generating a paired-end library contain- 
ing 35,397,530 reads of 101 bp. The data were assembled into 54 
scaffolds using the de novo assembly program Velvet (5). MAUVE 
was used to reorder scaffolds based on the reference genome of 
L. crispatus ST1 (6). tRNAs were identified using tRNA-scan SE 
(7). Protein coding regions were predicted using Metagene (8), 
and annotation was subsequently performed on the basis of 
BLASTP (9) analysis against a nonredundant protein database 
(nr) provided by the National Center for Biotechnology Informa- 
tion (NCBI). This automated annotation was then manually cu- 
rated in Artemis ( 1 0 ) . A functional classification was applied using 
the Clusters of Orthologous Groups (COG) database (11). rRNA 
operons were detected on the basis of BLASTN searches and an- 
notated manually. 

The draft genome sequence of L. crispatus EM-LCI consists of 
1,862,161 bp, with a G+C content of 37%, which is similar to 
those of other Lactobacillus genomes (6, 12). All predicted genes, 
proteins, enzymes, and their functions are putative. The draft ge- 
nome sequence contains 1,827 protein-encoding genes, repre- 
senting a coding density of 87.6%, with an average gene length of 
893 bp. No rRNA locus was assembled due to exclusion of re- 
petitive sequences from the assembly. Forty-five tRNA genes 



and 27 transposase-encoding genes were identified. No com- 
plete prophages were found in the genome sequence, but a num- 
ber of single open reading frames (ORFs) with similarity to phage 
genes were identified. Genes encoding a helveticin-like bacterio- 
cin and a class II bacteriocin-like product were also identified. 

Functional classification of the predicted genes by COG (11) 
showed that 1,383 predicted protein-encoding genes (75.7%) 
were homologous to members of known gene families, including 
155 (8.5%) identified as "general function prediction only" and 
130 (7.1%) with poorly characterized gene functions designated 
"function unknown." 

Nucleotide sequence accession numbers. This draft whole- 
genome shotgun project has been deposited in GenBank under 
the accession no. AXLM00000000. The version described in this 
paper is the first version, AXLM0 1000000. 
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